Skip to content

Skip gnome_configuration and RSPEED-1998 tests#7

Merged
emac-E merged 1 commit into
emac-E:mainfrom
Lifto:chore/skip-ambiguous-tests
Apr 9, 2026
Merged

Skip gnome_configuration and RSPEED-1998 tests#7
emac-E merged 1 commit into
emac-E:mainfrom
Lifto:chore/skip-ambiguous-tests

Conversation

@Lifto

@Lifto Lifto commented Apr 9, 2026

Copy link
Copy Markdown

Summary

  • Skip gnome_configuration — vague expected response, LLM gives correct answers but judge scores inconsistently
  • Skip RSPEED-1998 (Kea DHCP) — version-ambiguous query causes transient failures unrelated to retrieval

Both include skip_reason documenting why and how to fix. Depends on #6 for the skip field.

Changes

  • rhel10_documentation.yaml: Split gnome_configuration into its own conversation and skip it
  • jira_incorrect_answers.yaml: Skip RSPEED-1998

gnome_configuration: vague expected response causes inconsistent scoring.
RSPEED-1998: version-ambiguous query — LLM randomly picks RHEL 9 vs 10.

@emac-E emac-E left a comment

Copy link
Copy Markdown
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK we can make this default behaviour until we figure out how to word things better and address stability of the LLM answer quality.

@emac-E emac-E merged commit f5edde9 into emac-E:main Apr 9, 2026
5 of 15 checks passed
emac-E pushed a commit that referenced this pull request Apr 10, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants